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ABS^BACI • . • - . ^ 

The scientific iapXications and, practical 
applications of the Stein e^tiiator a^>proach for estiiating t^tte 
scores froi obserr^d scores are of 'aotentially gr^eat iiportance*^^ The 
-conceptttarl coiplexity- is n<%t-ittch oreater than i^^at re^ttired for sore 
conventional, regression aodels. ime^ eapirical Bayesian aspect ^ a:lloifs 
the exaiiner'to incorporate his/her. 9ifn degree c^f^prior information 
aboat selected examinees. ,.T]hi^approaph allows for a lore accurate.- 
estiiation "Of trtte scores^ vatk the corollary 'o£ ii^.ng fever t^st , ' 
itens to achieve those tras sco^e estimates. Efron^ aad;. Horris make 
the. point that there . is. little penalty for tt^ing* tbeV^l^? discttssed^ 
here* Becattse they cannot^give larger total mean sgttared error than 
the maxiittm likelihood estimate. This assttrance «ay be^ sttfficient 
nAasoh'for more carefttl examination of the tttility of. the Stein ^ 
es^ii^ator and its limited translation estimators as they apply to 
'behavioral and sociCL science researci;^ A*nttmerical exampl^e is given 
of prediction of a season's batting averages on the basis qf a sample' 
of abott^ 10 p^rc^iit o£ the times at bat. ""{Author/CTH) 



t 



;«* ' 'BepfOdttctdLons sttpplied by EDSS are the. best that can be made * 
♦ . ' from the original docttmept. — ' . ' 
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USING STBIN'^ ESTIMA^^OR TO, PREDICT UNIVERSE SCORES 
- FROM OBTAINED SCORES 



The ptu^se of t^jis |>&jper is to introduce and apply a recently 
developed statistical^ mettkod j^;c estiiiating true (popu^tJon) scores 
from observed (sample) sfcores. Provided th^t^t least three scores 
^ are availabK, this me^od overall will give tnore accurate true score 
e^tintates tiian the. ih/iyidual^inaxiimjin- livelihood bstintates (NttE) i - * 
regardless o£ the tru'p abillt:\e& o£ the examinees (Efron $ Morris, 
1977) - Th& method can be used without kii^owledge o£ the (Bayesian)' 
*prior distribution, ^d norm^Xit^ o£ the true scores being estimated 
need not be ^assumed. The ' tli^e^retical a]>d practical ^ implications d£ 
t^e method extend beyond psychological measurement' t;^ the very £ounda- 
tions o£ statistical in£erence »and have. caused some tumult in Jihat^dis** 
cipline during the past decade.^ ^* ' ' . . 

^ 'A 

i .\ -- ^ ^ . t 

HISTORICAL OVERVIEW 

■f i * 

4 * 

For the Gaussian distribution^ the- average Is the'best estimator 
o£ the true mean, 6* The avjerage is said €o be "unbiased" bebause no 
single value o£ & is £avore4 pver any other value/ That 'is t the ex- 
pected value o£'the average, ic, equals the true value o£ re0ar<3less 
o£ the:v2dufe of 6, How many unbiased estimates o£ © are there? An 
infinite number. But, none o£ %hem estimates d per£dctly. 'The expe^^ted 
squared erroi:! of estiiQation for the av^rag^ is lowe^ than that for^ any 
other linear or nonlinear and unbiased function of the data.r. > 

* ' ' \ * 

A departure from this classical approach assumes th^t unbiased esti-' 
mates of B are not the only methods by which to in^er population_values. 
For example, other possible estiiiiates' of d coiild be the-inedian, x/2, 2x, 
the mode, etc* All such estimators, can be compared through a risk func* 
tion,* >^ich is the expected value of the sqi^red erro^ for every possible 
y2Llue Plots of risk functions show that there is no estiinator 

with a risk function "that is everyvdiere lower than the risk function of 
the average, x, provided that a single mean is bein^ estimated . -But in * 
the more general case, a score is available from each of many examinees 
who have taken a test', for example, and it is th^ true score of each 
examinee that is to be' inferred. Thus, the HItE is merely- a specific 
.case" of the more genserai sittiation ^^Jiere the mean scores. (6's) are 
^sought for each examinee. ' ^ \ ' ^ , 

^ Theoretical work conducted by ptein (1955) and by James and stein 
(1961) concentrated on estimating sevefal unknown means, Jdirough' methods 
other than maximum likelihood estimation, ^e authors assumed t]J^t^ the 
means >re independent of e&ch other and* that the goodness of vatious \' 
estimators can be assessed by a risk function t' the sum 'Df- the expected . 
values of the squarelT errors ^f estimation for al'l of the individual 



J 



means* Alao, it is not necessary to assiung that the mesms being -es^ti** 
mat§d come from normal distributions. What James and Stein proved is 
that w^en^three^ or mare mesms (9 values) .are ^eing ^stimat^i it is % 
"4es3 than optimal solution ("inadmissibre") tO' estimate each 9 from its 
own average, That is, estimation tules cian be found with smaller ^otal 
i^sk regardless •of the values «f the true means (9*s) for ^ach examinee* 
As Egton and Mortis {1$75) express this accomplishmopt: 

-Charles Stein showed that ^t is possible to make a uniform ' 

improvement on t^e maximum* likeHhood estimate (Ml£) in . " 

t^rms o^ tota,! squared error xisk wh en v estimating several ^ ^ 
parameters, • . This achievement leads, immediately a 
uniform, nontriviai itaprovementj over the least] squares 
■ (Gauss-MarJcov) estimators for fihe parameters in the usual 
formulation of the linear model (p. 311) • I 



THE STEIN ESTIMATOR 



The following ^discussion ^serves as an introdudt-ion to the Stein 



estimator/ Assume that we have parameters 9 



.•9j^t )c >. 3, and 



that for each 9^ we observe an independent normal yariate x^ with meaii 

eaidi-x /Might be the 
Then x-.'V/ntft^, a /n)»* 



E- oc S 9, t and variance Var. (xj = 1, Note that 
' ^i ^ ^ * ' 9j[ 1 2 

mea;i of n independent ^sservaticiis ,'^i{9. p CT ) • 

and a change of scale tr£msf6rms a /n to the more convenient value of 

1; Therefore, the above assumptions often occur as a reduction from 

more' complicated 'situations to this canonicals' foml* 



jrhe primary objective for applying the set o 
■to -estimate the unknown vector of means 9, 9 B (9 
^ -^rfonaance of an estimation rule is assessed by -computing the sm of* 
=o»pc„.«t errors th.t 1. thi errl loi. £or that ..tl- 



estimation rules is 



idatipn rule* If 



{/j^, /2"'' is^an estd 



Ltion'rule, wh^re /. 



is;tme estimate of 9,,, then the squared error loss L(9, /) is defined 



as 



i = 1 

; i ■ • ■ 

In the case "of the maximum -likelihood esti 



sitor,^ or the sample 



medn; /'xifiiioted by V {t) , T Ix) E (/J (X) , (X) J . . & ) B (x^, , , ,x^) 



there is k constant risk, R, witfli R(9, 7^(X)) - R(9, p?)'= E 

2- , ' ■ * 9 i = 1 . 

(x^ - 9^) = k. (Note that E^^indicates t^e expectation over the dis- , 

tribution x,|9 ^^^■N(9,, 1) introduced above. observe*that E^(x, - 9,)^ 

= l^for each i, i - l,,,.,kV) . . . ♦ , . 



The Stein 'estimator may used t^ 'estimate 9. Define the Stein 
estimator, ?^(X) s'(/J(x), (X) , . . ./J(X) , k > 3, as foll<3ws: 



where \i B represents an initial guess at the tyue mean, 9^.^' 

and S is defined by s 5 Z(x, - M.)^. This estimator thus^has risk 

k . ■ ^ 

^ R(f 7'(xn E -I a\ " 9^)? < k^ V . 

' '4 =^ k - 2 + Z^{9. - Ujf 

for. all 9. If 9^ = for all i, the, risk Is- 2, which oomparea quite 
/ favorably to'k obtained for-the sample mean^ In any event;-. tfie-. risk 
for the, Stein jestimtor is less than that for the maximum likeiiho^^' 
estinnator, A discussion of how the ri^ for the Stein estimator- was. 
obtained, is presented in the last seo^tion cff . this paper, *) 



The Stein estimator has a very natural interpretation in^an .^piri-,/ 

cal Bayes context. If the^O. themselves are a sample fron^ prior ^iB/^ . 

ind ' ' * * ^ ' * 

tribution, 9, N(M., T ), i ^ l...k, then the Bayes estimate of - 

is the a ^oatex-iori mean of 9^ given the data, ^P^Jx^i}& def iije'4 iSy 

' \ A ■ ■ ■■ •'" ■* 

^ f J ^ • ' ■ '■ ■ 

2 ' ' ' \ 

In the erapixlpal Bayes situation, i is .uoknoyn', but-^it can be 

' ' ■ : ' .. . , 

estimated because marginally the' x. are indepeadtotly ni>nn^l with means 
* k : . ' 

and S = Z (x^ - y^)^ (1 + -t^) X.^ is a'.ic^l-square distrib- 



« £ (X - y ) (1 +-T ) Xi,/^:^^5? Xu is a'.ic^l-sj^uare 
j = 1 "^ M- ' ■ .' « 

ution with, k degrees of freedom. Given th^t k ^ 3, ^e/tinbiased estimate 

Ck. ^ 2) 1 / * - * . 

E ' a — -- ^- ^ is available,. , l.* - " 



k - 2 1. 
X ' Substituting — ^ — fot the untoown - g in the Bayes ' estimate 
it ■ / k-2\ 1 + T 

A results in + 1 1 — J (x^ - p^) , \^ich is the Stein estimator - 

Predicting Batting Averages Using the Stein Estimajjor . , 

• . _ ■ * . 

The following example J.s adapted from Efron and Morris (1975f 
1977)* Batting averages for ma^or league baseball j^layers,, based upon 
their first 45 times at bat, wer^ obtained. The objective was to pre- 
dict ^ach player's batting average for thte remainder 4>f the season. A 
cutoff after the first 45 tiroes at bat was chosen because that number ' 
was large enough to insure a satisfactory approximation to the binomial 
distribution by the normal distrilmtion and because the vast majority 
of "at bats" for the season would be ^estimat^d.' The model assumes that 
hits occur according to a l^ihomial distributlcfn with independence be- 
twe^^ players. (Requiring the same number of trials for all players^ 
n ^ TSf assures equal variances; however, the Stein estimator can also 
be used v^en variances are unequal, | see Efron and Morris, 197S.)' 

Let*Y^ be the batting average of -player if i = l^...k (k = 12) 
after the first.*45 ^imes at-^t. Assume that nY^ Bin(n, p^), 
i ^ l,...12, whe^^^^ i? the -true season batting average, i.e, , 



Because the variance of Yr^ depends, upon the mean, the arc-^in 

transformation for stabilizing the variance of a binomial distribution 

is applie*i: x, ="f^c(Y, ), where f (y) = n arc-3in(2y -1). It can 
1 HO in ^ _ 

be shown that this transformation results in x. having nearXy unit 
^variance independent of p « 

1 4 

The mesm 9^ of x^ is given by 9^ = fjj(Pj^) • Values of y^, p^, x^, 
9^, and p^ are listed for players 1- through 12/in Table 1. Batting, 
averages for the first 45^ timed at bat are listed in colunm 1. Each 
player received from 270 to"S90 additional "trials*" ^during the season. 
'The batting averages for 'this se^sona^ trial number are listed* in 
column 2, Rec£al that the objective here is to predict each player'? 
column 2 ("true," "population") vklue using the^ initially obtaihed 
column 1 f^sau^le") value. * 
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Table i / 
Exaiqple Using Batting Averages From 12 Players 



(1) 


(2) • 






(?) 


(6) 


• 

1 t 

1 


1 


. 1 * 


. ®i 


{i 


A 


.400 


• 346 


-1.35 


-2.10 


-2.49 


.319' 


.376 


,296 


-1.&6 


-2.79 


-2.60 


.*-.311 . 


.356- 


,276 


-1.97 


-3.11 


-2.71 

f 


,303 


.333 ' 


,222 


-2.28 


-3.96 . 


^ -2,82 


,296* 


.311 


.270 


-2-60 ■ 


-3,20 


-2,93. ; 

< 


,"287 


.289 


" _;263 


-2.92 


-3,32 


-3.03 ^ 


,283 
• 


.244 


-269 


-3.60 


-3.23 


-3.26 


* .265 


*222 


^^^303 


-3.95 


-2-71 


-3.40 


.258 


,222 


■■•>264 


-3.95 . 


. *^3-30 




.258 


,222 


' -226 


-3.95..' 


-3.89 


-3 .''40 


■ .258- 


,200 


.'28.5 


4r32 


-2-96 


-3.53 


-249 


,178 


.316 


-4.70 


-2-53 


-3-66 


* .241 ' 



Note * 'Listing of the MLE Scores and Estimated Universe Scores (columns 
I'^and 2), Score Transformations (columns" 3, 4, and 5), And the 
* Estimated Universe Score from using -Stein's estimator (column 6) 



values obtained upon application of^tfie arc-sin ^ansfonna- 
tion to the column J- batting .averages (obseryed 'scores) are shown in 
column 5, Similarly, the 6^ values obtained ty applying the arc-sin . 
transformation to the column 2, batting averages ^are shown ift column 4, 
The'Stein estimator- values that^ e$timate th^ 9. are shown in column 5, 



and the values obtained upon retransf orrtation' via the arc-sin ti^rtfs- 
fonnation are given ii) column 6; The following calculations are ^stam^JJ' 
pies of the type of computations required, \Note that the ccMnputatio^^ 
are not at all compl€>x* _ * ^ * ' . - ^ 

For i = 2, f {Y_) ^ 45 arc-sin (2 ( ,978) - 1)= -1.66. Therefore, 

n , , 

x« = -1,66, ai^d is entered in column 3, Similarly, '9 = f {pJi = 
*i ' * * fi n ^ 

45 arc-sin (2(,298) - 1) , This value is given in column 4,^^Values 
for x^f x^f • *,x^2 ^^x' ®j""®i2 obtained through similar , 
substitutions « 

* 

The basic equation for the Stein estimator f^, which allows us to 

estimate the ith component of 9, is slightly different from the expres-. 

sion introduced previously. We estimate the initial guess ]} ^ y^A 

by X = Sx^/k, which shrink^ all x^ toward }C The resulting estimate 

of* the 1th component 9, of 9 is '"given by 
, . ^ * ■ i ■ 

-/J(X) = 5r + ^1 - ^^-^^y^i where v =.J:(x^ and Jc - a = 

tk - 1) - 2*f because one parameter is estiInated^^ . 

in the empirical Bayes case, the appropriateness of- this formula- 

' * ' " ' k - 3 

tion follows if X is used as the unbiased estimate for ]i and — r — 'as 

the unbiased estimate for Y~^~t^*^ Therefore, in the case of the *exam- 
pie data provided in TaCble 1, 

■ ■ . ■ ' r 

X = Zx,A = <-^-^^^ ^...^ (-4.70) ;,3^^^^ ■ 
The value for X. may in' turn be used to compute V: 

v'=Xtx. - Xi^ R.(-l*a5 - (a.lO)J^ (-4.70 - t-3,10))^ = lj3.81. 
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The itein es;timates for 9^* * .9^^ 'are*der£ved.by substilpiting the^ 

' obtained values for % V the computation£(J equationi 

♦ f ■ • f ♦ , 

7f (X) = -3aO+ fl. - "an '(x. - (-3.10)) ='.3^01t!-* 2,02,' 
1 ' ' \ . 13.81 ^ . 1 , . 

f " ■ - . ■ 

^ s For lexairiple, (X). ,350{-i;35) - 2^02 =''2,*49, This Value and 
the Values for ^/^^ listed in column 5 of Tabrle jL\ These values^ 
* are finally retrsmsjoofed to dbt^in the estiijates of the "true score" -^t 
average for each- player in column 6* . - 

The total squared prediction error for / (X) is* defined as 
(/J - 9^)^ ^2 "^12*^ ^ 4^»040, ThU value is obtained by , - 

subtractinj^the column 4 yalue fXQni the cqlumn 5 value for each of * , 
the 12 players, sqiiaring the differences, smH sumntiiig, * ■ - 

In the Case of the san^Ie mesm, X, the total squared j)Vediction 
error is defined as 2{x - 9J - 15^135, This .v^lue *is obtained by* 
subtracting the column 4 Value from thp c^umn 3 value for each of the 
H ' 12 ^layecs, squaring the differences, anS summing* ' * / * 

The adequacy of Stein's estimator relative to the sample mean may 
be iieterniined by computing their relative efficiency* ^The efficiency 
of* Stein's estimator relative \o *l;he 'sahple mean is defined as 



In this example, ^e efficiency is 3,746, In other words, Stein*s esti 
mator is nearly four times as '*ef f icient" in- predicting ^'universe" or 
"triie" scores from observed (sample) scores as is the HLE* 
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Limited Translation Estiaaators 



Stein estimators* achieve uriifozmly la^r* aggregate riaJc thaiftlie 

HLE (sample mean) / as shown abo\re/ but may result^ in. increased risk to 

individual cOTponents of in partio^ilar, the Stein estimator i^niay ^do 

poorly in estimating with very larger or Very small values* OAierefore 
1 • ^ 

eveiv^though'/ (X) provides better' prediction in the -aggregate, dhe may— 
grpssly err with IndjividuaX* components rA»d^srrable ^comprcMaise vould 
be to have bpth good aggregate anii good individual^ pr^iction, \^ere 
improved individual prediction would occ^r with minimal, if any, loss 
'in aggregate prediction'eff ^ci^ncy* l^is tradeoff may ^ achjleved by 
using '*limiteid translation estimators" that reduce individual risk for 
outlying cases and result in minimal loss in aggregate prediction. 

Limited translation estii^ators are introduced to reducd' potentially 

la^ge mean squared prediction exxors associated with individual c^po** * 

nents* Shrinkage of ' 7^ values toward values is accoanplished through 

the estimate .0 <, s < i, of 9, \ (Here, 7^ = tocj 1] 7] ) . ' 7® is 

1 ^ * '^l-'^ i''i"»i i*^ i 

defined to be as close to /i" as ppsstble, sc^ long as it does not differ 

Pck - 1) (k 3>*s T ' n ^ 
by more 'than, j-r ^^—^ 10^ _ j^ts) . steuidard deviations of x^ 

from x^, _ ^(s) is a constant, obtained from a table of limited 

translation estimators (Efron a Morris, 1972)", 

Data from the baseball example will now be used to illustrate*^ t^e 
application o,f limited translation estimators* Kotice in Table' 1 that 
the first player*^ season average far*exceeds the season averages of ^ ' 
the remaining" players, an example of^ jan outlying case* In bas^bsCll 
example, k = 12, and V was found to be Xpgpk* l^l^eref ore,^^l3y obtaining^ 



values for.D.^ , (•S) and D (,8) frcmfiS^fron and M orgfca— 14»72) table, 

* 9 ^ ' ff^ * 

it is ^ound tAat /* [%) may differ Iw no i*>re than iTS from x aiid ■ 

/* (X) may differ by no more *SI5 from x, *^ In dther words, by apply-. 

9 ' i"! ^ \ 1 

ing /; it means, that if \S , - x, <,*75, then is retained; but if 

- x^|> *75, fl- ip set' egual to the value differing from x^ by *75* 
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■ " ^. ' '1 .0 

Tablq 2 contains, values for the 12 players fdr pjf^Tfj,.Pj> , 
.'a ' ' .9 ■ .8 - 1 1 M . 1 ■ 

and'^] . ,Value'3 for, pj and pV are obtained as follpws'^ Cons£der 

the first player, = ••L.SS^ and = *2.49j ther^ore fx^ * 

1.14 > rf?7^W^£|2-^^=^ - /Jl = 1^14 > .56.^ 



..I ■ 



Thus^ 

/] = -1.91.> These values are i^tetranslated to obtain ^' ^ .346, and -' 

^ ' ■ , . , ' . ' - 

-pjv- ^ ;360. Hotice that p-v*^ '.346.' 'Theteforje* pitovides' better 
^prediction for: this. ^ "-'^ ^ ^'^ " ' ^'^ 



^i T ''i- 

individual- l^iSui or ^'^y ^^'iaso n6te that is.' 

clos^ to the pj^ ,yalue-!t^naii - All three prediction estimates, are 

closer than the MLE. value of Y . .400 '. In the cetse of the second - 

player, thcqgh^ the ^, value became farther removed^ rqia p, as^thS ^ 

. , ' * t ^ . ■ "^"-^ — ' — ^ 

vajrue s decreases from,! to .9 to^ .8.^ l^ereforp, the translations. 

.aq^e increasing the squared prediction error for;, that player rather 

thaB decrea^sing it. in the case of the fifth individual, j/. - X; | < .75 

and {/^ ^ ^^1 ^ *^^' the Wtisaat^ value remains the same under tfans" 

lations s = .9 'and s « .8. The estimated v^iiue vdll not chan^& .junttl ^ 

j/^ x^j <' .33. Ih thl^ par6i<!ular exan^le, the translation is in- 

creasing the error for many^individual components by increasing the dif-* 

ference between the estimate and the true score. - 



Recall that the efficient/ of Stein 's '.estimator, / (X), relfitive 
to the sasfgle mean was defined to be ' ^ . 

= 3.-746. ■ * ■ :* ~ 



SI/, - V' 



The efficiency of the lin&ted translation estimator'/' ^XX) relative to 
the%ample mean is defined t^be ' • ' 

. 2K- V , ' 



J 8 
which €(iual3 3.077.^ .Similarly, , |Qr f \ (X) the relative efficiency 



equeds >2'.462; Therefore, 'in> this, e^tample jr^(X). has tH& greatest '^ffi-^ 
ciency ^f the' three estimators^ / , /' ^;<and /.V^- ■ . ' * 
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Table 



Batting Averages an^ Their Estimates 



^1 t 



.346 


.400 


<-1.35V ' 


.319 


'{*2.49)- 


. .346\i-2>iJ) 


.360 (-1.91) 


1 .14 ■ 




.378 


(■a.66)' 


.311 


{-2.€0) 


.324 (-2. 4i) 


.3:38 (-2.22) 


.94 ■ 


.276 


.356 


(-l'.97) ' 


.303 


(-2.71) 


.303 (-2.11) ' 


.316 (-2.53) 


' .74 


.222 


.333 


{-2'.28> 


.296 


■(-2.82) 


.296' (-^.82) 


.296 (-2.82) 


-.54 


.270 


.311 


{r2.60) 


.288 


{^2.93) 


.286' (-2.93) 


.288 (-2.93) 

.1 ' 
,282 (-3.t)3) 


.33 




.289 


(-2.92) , 


.282 


(-3.03) 


.282 (-3.d3) 

'r 


.11 . 

* 




.244 


(-3.6&>-* , 


.265 


(-3.28). 


.26*5 (-3.28)" 


.265,jE»-3.28). 

t 


.32 


.303 


.222 


(-3.95) 


. 258 


C-3.40) 


-.258 (-3.40) 


.258 (-3.40) 


.55- 

X* 


.264 


.222* 


(t3.95) 


.258 


(-3.40) 


.258 (t3.40) 


* .258 (-3. 40) * 


,55 


' .226 


.222 


(-3.95J . 




(-3.40) 


.258 (-3-Uo) 


.258 (-3. 4^0) 


.55 ' 


.285| 


.200 


(-4.32) 


.249^ 


(-3.53) 


* 

.246 (-3.57) 


.234 (-3.76) 


.79- ^ 


«.316 , 


.178 


(-4.70) 


.*24i 


t-3^66) 


/ .222 {-3.9S) 


VjlO (-4.14) 


1.04 ' 



Relationship Between Aqqxteqate and Individual Oontponent Mean^ 
Squared Prediction BtrorB . : . ^ ^ 

prlpr infornatloB a]fbut pertain 'examlnaes pa^ "^ft^ed'to produce 
modified estimates of their true ^r universe' scor6ts;,?r^eDK this sense , 
tlie estimator functions as an empirical. Baye^ian *^^dic@mn model.^ This 
prdcedure is m^st effectively lised ^en the examinee has ughly credible 
Infonnation about .specific' exAmiiiees^ vhich is tantamount^o having a 
high ^rior. prob^ility/ in the usual Bayeslan sense. ^ As ^^^[ig^ult/ 'for 



10 



these particular examinees, the fit Of test scores. to "true*', scores may. 
be improved* considerably by use of a limited transiatiojrtfesti^tor. 
Howe^verr ^ven though the limited' tratfslation^estijnator yi'^l^^a slower 
aggregate' squared prediction error for the set of , examinees as a whole 
than does tiie MLE ^sample mean)-, it may reduce ;th* overall ,efficienc:y 
from that of by increasing the mean squared prediction errors fdr 

other examinees in the population. TJierefore, ovefall' ef foJcie^icy, in- 
diviTdual^ squared prediction i^rror^ andT prior information available* on . 
some examinees must all be considered jsinrultaneously to determine^- what ^ 
translation, ' if any,, is to be . performed * , , 

If there i^ uniform prior information about all examiiaees ih ^e 
score distribution, it may be best to maximize" the aggregate 'eff^cientsy^^^ 
l£ no inj^onnation about true scores, is available, tt is' impossible, to 
assess which individuals ^have the greatest squared prediction e^^rbrgV ;/ 
associated with them. Therefore, a good strategy ^would be to achieve-;^^^^, 
maxijnal aggregate ef£iciency^ . 



r 



If prior informatibn is concentr^ed, at the extremes of the score 
distribution, translations may be applied to bring the predicted. score 
more in line with the type of score that -might be expected, based upon 
prior information* In accomplishing this reduction, however, one must 
evaluate its effect on aggregate efficiency. First, the individual ^ 
scopes cani:)e adjusted until they are^ in line \jith prior expectations, 
and the. resulting aggregate efficiepoy ther^ eyaXuated. ■ Or, one <ian . 
focus on attain in^^maximunt aggregate efficiency and then notice how"" 
the scores of examinees for whom prior information is available are ^\ 
influenced by minor translations.' A major decision is to determine 
at what point scqre^'f itting f or particular examinees b^cones counter- 
productive or inefficient, because minimal additional in^rovements are 
achieved at a^ high cost to the oyer^il aggregate efficiency. 

A case in p>oint is when the "true"^score does^not fall between the 
MLE^nd / (IE), 43ut when /^(Ic) falls between^the true Bco^,and^the san^le 
mean. Shrinkinq the'di^fexence betweei]^ the san^le mean and /^{5)-j3y ap- 
p^licatioh of a limited translation estimator, f^{%, ^actually increases' 
"^the squared prediction error for that examinee* IJhe reasoning is the 
same when all prior information on an examinee do^s not fall between, the ^ 
MLE (sample mean) and f^{% . . ' ^ , 

/There are also several methodofoqical considerations -in relatting 
obtained and "true^* score estimates^ Initial trials may underestimate 
a '*'true" score if ^ the learning curve has nolt yet reached asymptote ifi 
this "number of test trials. Likewise, fatigue'^ from the last group of 
test items could produce an 'underestimate of the ^'true*' l^core* 




\, tony factors need to.be considered in relating observed scores. and 
true .scopes/ ii) 'applying. ,li^^ed translations, dn optimizing good indi- 
vidual as<3 gck}d aggregate predictidh, and in using prior information on 
sp€ci^iq. examinees productively*. The usefulness of St€rin*s estimator 

1i>ehavjlo]?al and educational research lar^gely depend^ upon hev well 
thB'se"con^i<iera,ti6ns are 'addressed* * - 



^ ' ' ' ^ '\ ^ SbMMftHS 

. The scientific implications and practical applications of the Stein 
estimator^ approa<;h for estimating true scores from observed secures are , 
of potentially, great, ijnportance. The conceptual complexity is not much 
gre^te^ t^mn that required for more conventional regression models* The 
^pirical, Bayesian^as^i^Ct allows the examiner to incorporate his/her own" 
degree. ^rior information about, selected examinees. This g^roBch . ^ 
allows £6r,a more accurate estimation ojE true^careS/ with the corollary, 
of *usin4 fe^^ ,test items to achieve those true score estimates* . ^f ron 
and Mo^riK(1975) make the point that "there is little penalty^ for using " 
the rules juScussed here because they cannot give large total. mean squared 
error' thaifxhe' MI£* * ^ *'* This ass^irance may be a sufficient reason fo^ 
more: careful ^examination of the utility of the stein estimator and its 
limited translation estimators as they app];/ to behavioral and social 
science 'r^searc^. " - ^ . • 
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